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Abstract 

These notes deal with various kinds of distance functions and re- 
lated properties and measurements for sets and functions. 
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1 Basic notions 

Let M be a nonempty set. By a distance function on M we mean a real- 
valued function d(x, y) denned for x,y G M such that (i) d(x, y) > for all 
x, y G M, (ii) d(x, y) = if and only if x = y, and (hi) 

(1.1) d(y,x) = d(x,y) 

for all x, y G M. If moreover the triangle inequality holds, which is to say 
that 

(1.2) d(x, z) < d(x, y) + d(y, z) 
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for all x,y,z G M, then the distance function d(-, •) is said to be a metric on 
M. More generally, if there is a real number C > 1 such that 

(1.3) <C(d(z,y) + %,z)) 

for all x,y,z G M, then the distance function •) is said to be a quasimetric 
on M, with constant C. 

A basic example occurs with the real numbers R equipped with the stan- 
dard distance function. Recall that if x is a real number, then the absolute 
value of x is denoted \x\ and is defined to be equal to x is x > 0, and to be 
equal to — x if x < 0. Thus the absolute value of a real number is always a 
nonnegative real number, the absolute value of a real number is equal to 
if and only if the real number itself is equal to 0, and the absolute value of 
a product of two real numbers is equal to the product of the absolute values 
of the two real numbers individually. Furthermore, 

(1.4) \x + y\ < \x\ + \y\ 

for all real numbers x, y, which is also referred to as a triangle inequality. It 
follows easily that the standard distance function \x — y\ on the real numbers 
is in fact a metric. 

A distance function d(x, y) on a nonempty set M is said to be an ultra- 
metric if 

(1.5) d(x, z) < m&x(d(x, y), d(y, z)) 

for all x,y,z G M. Thus an ultrametric is automatically a metric. One might 
consider the weaker condition that there be a real number C > 1 such that 

(1.6) d(x, z) < C' max(d(x,y),d(y, z)) 

for all x,y,z G M. Of course this would imply that the distance function 
is a quasimetric, and conversely this condition holds for quasimetrics with 
C equal to 2 times the quasimetric constant. In other words, this weaker 
condition is equivalent to saying that the distance function is a quasimetric, 
except perhaps with a slightly different constant. 
Let q be a positive real number. If q > 1, then 

(1.7) (a + b) q < 2 q - x [a q + b q ) 

for all nonnegative real numbers a, b. This amounts to the convexity of the 
function t q on the nonnegative real numbers. It follows that the distance 
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function \x — y\ q on the real line is a quasimetric with constant 2 9_1 when 
q > 1. More generally, if d(x,y) is a quasimetric on a nonempty set M, 
then d(x,y) q is also a quasimetric on M when g > 1, where the quasimetric 
constant C for d(x,y) is replaced with 2 q ~ 1 C q . 

Now suppose that < q < 1. In this case we have that 

(1.8) (a + 6) 9 < a 9 + 6 9 
for all nonnegative real numbers a, b. Indeed, 

(1.9) a + b < 

< 

Thus we get in this case that the distance function \x — y\ q on the real line 
is a metric when < q < 1. If d(x, y) is a quasimetric on a nonempty set M 
with constant C, then d(x, y) 9 is a quasimetric on M with constant C q when 
< q < 1, and in particular cf(x, y) 9 is a metric on M if d(x, y) is. 

For ultrametrics the situation is much more straightforward. Namely, 
if d(x, y) is an ultrametric on a nonempty set M and q is a positive real 
number, then d(x,y) q is also an ultrametric on M. If more generally d(x,y) 
is a distance function on M which satisfies (J1.6J) for some C" > 1, then d(x, y) q 
satisfies (jl.6j) with C replaced with {C') q . 

Let d(x, y) be a distance function on a nonempty set M. We say that 
d(x, y) is almost a metric with constant C > 1 if 

(1.10) d(w ,wi) < C^2d(wj,Wj^i) 

3=1 

for any finite sequence of points Wo, w\, . . . ,wi in M. If we restrict our at- 
tention to the case where 1 = 1, then this is the same as the quasimetric 
condition. If d(x,y) is a metric, then it is also almost a metric with C = 1, 
by repeating the triangle inequality. However, if q is a positive real num- 
ber such that q > 1, then the distance function \x — y\ q on the real line is 
a quasimetric but it is not almost a metric, as one can check using simple 
examples. 

Suppose that d(x, y) is a distance function on a nonempty set M which 
is almost a metric with constant C . Define p(x, y) for x, y G M to be the 



(a 9 + b q ) max(a 1 - 9 , 6 1 " 9 ) 
(a 9 + 6 9 )max(a 9 ,6 9 ) (1 - 9)/9 
(a 9 + 6 9 ) (a q + b q Y 1 - q)/q 
(a q + b q ) 1/q . 
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infimum of the sums X^ = i d(vjj, Wj-i) over all finite sequences w , . . . ,wi of 
points in M such that Wq = x and wi = y. We have that 

(1.11) C- 1 d{x,y)<p{x,y)<d{x,y) 

for all x, y G M, where the first inequality follows from the assumption 
that d(x, y) is almost a metric with constant C, and the second inequality 
is automatic from the definition of p(x,y). It is easy to see that p(x,y) is 
a distance function on M, and in fact it is a metric, because the triangle 
inequality for p(x, y) is also an automatic consequence of the definition of 
p(x,y). 

If d(x, y) is a distance function on a nonempty set M, then we say that 
d(x, y) is almost an ultrametric with constant C > 1 if 

(1.12) d(w ,wi) <C'max.(d(w ,w 1 ),...,d(wi- 1 ,wi)) 

for any finite sequence of points wq, . . . , Wi in M. When / = 1 this is the same 
as (|1.6|) . and an ultrametric is almost an ultrametric with constant equal to 
1. On the real line, the distance function \x — y\ q is not almost an ultrametric 
for any positive real number q, as one can verify. 

Let d(x, y) be a distance function on a nonempty set M which is almost an 
ultrametric with constant C . Define o~(x, y) for x, y G M to be the infimum 
of 

(1.13) max(d(w , Wi),..., d(wi-i, Wi)) 

over all finite sequences Wo, . . . , wi of points in M with wq = x and wi = y. 
Thus 

(1.14) {CT 1 d(x,y)<a{x,y)<d(x,y) 

for all x,y G M, where the first inequality follows from the assumption that 
d(x,y) is almost an ultrametric with constant C, and the second condition 
is immediate from the definition. Clearly o~(x, y) is a distance function, and 
in fact it is an ultrametric, because it satisfies the ultrametric version of the 
triangle inequality by construction. 

Suppose that M is a nonempty set, E is a nonempty subset of M, and 
that d(x, y) is a distance function on M. One can restrict d(x, y) to E to 
get a distance function on E. If d(x, y) is a metric, quasimetric, ultrametric, 
etc., on M, then the same condition will hold automatically on E as well. 
However, depending on the structure of E, it may be that the restriction of 
the distance function to E satisfies conditions that do not work on M. 
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As a basic example, let M be the real line, and let E be the usual Cantor 
middle-thirds set, as in By definition, one starts with the unit interval 
[0, 1] in the real line, consisting of all real numbers x such that < x < 1, 
one removes the open middle third of this interval, leaving the two intervals 
[0, 1/3] and [2/3, 1], one removes the open middle thirds of these two intervals 
to get four closed intervals of length 1/9, and so on. At the kth stage of the 
construction one has a set consisting of 2 h closed intervals of length l/3 fc , and 
the union of these intervals can be denoted E^. By construction, Ek+i C 
for all k, and the Cantor set E is defined to be the intersection of all of the 
Ek's. The restriction of the usual distance function \x — y\ to the Cantor set 
is almost an ultrametric. 

For both the notions of a distance function d(x, y) being almost a metric 
or almost an ultrametric, one can consider more general conditions with 
a lower bound in terms of some function of d(x,y), in place of simply a 
positive multiple of d(x, y). One could still define p(x, y) or a(x, y) as before, 
with p(x, y), a(x, y) < d(x, y) automatically, and with lower bounds in terms 
of a function of d(x,y). One would again have that p(x,y) is a metric or 
that a(x,y) is an ultrametric, as appropriate, by construction. This type of 
condition occurs for instance with more general Cantor sets, in which the 
sizes of the open intervals being removed are more variable. Of course these 
sizes are also relevant for the constants involved even if one has lower bounds 
in terms of multiples of the original distance function. 

Remark 1.15 A result of Maci'as and Segovia ^2] states that for each pos- 
itive real number C\ > 1 there are positive real numbers a, C2 with a < 1 
and C2 > 1 such that the following is true. Let M be a nonempty set and 
d(x, y) a distance function on M. If d(x, y) is a quasimetric with constant 
Ci, then d(x,y) a is almost a metric with constant Ci- 

Let n be a positive integer, and let R n denote the usual space of n-tuples 
of real numbers. Thus x 6 R" can be written explicitly as x — (yX\^ . . . , x n )j 
where each component Xj, 1 < j < n, is a real number. If x, y G R n , then 
x + y is defined to be the element of R n whose jth component is equal to the 
sum of the jth components of x, y. If x G R™ and t is a real number, then the 
scalar product t x is defined to be the element of R n whose jth component 
is equal to the product of t and the jth component of x. In this manner R" 
is a real vector space of dimension n. 
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Let N(x) be a real-valued function on R n such that N(x) > for all 
x G R n , N(x) = if and only if x — 0, and 

(1.16) N(tx) = \t\N(x) 

for all real numbers t and all x G R™. If also N(x) satisfies the triangle 
inequality 

(1.17) N(x + y) < N(x) + N(y) 

for all ij6 R n , then N is said to define a norm on R n . In the presence of 
the other conditions, one can check that the triangle inequality is equivalent 
to the convexity of the closed unit ball associated to N. In other words, 
in the presence of the other conditions, N is a norm if and only if for each 
i,H 6 R n with N(x), N(y) < 1 and each real number t with < t < 1 we 
have that N(tx + (1 — t) y) < 1 too. As a generalization of the triangle 
inequality, if iV satisfies the first set of conditions, then we say that iV is a 
quasinorm with constant C > 1 if 

(1.18) N(x + y) < C(N(x) + N(y)) 
for all x, y G R n . 

As a basic family of examples, let p be a positive real number, and define 
\\x\\ p for x G R™ by 



(1.19) 




Let us extend this to p = oo by putting 

(1.20) ||a;||oo = max(|zi|, . . . , \x n \). 

These functions clearly satisfy the first set of conditions mentioned in the 
previous paragraph, namely \\x\\ p is a nonnegative real number which is equal 
to exactly when x — 0, and \\t X 1 1 <p IS equal to the product of \t\ and ||x|| p 
for all real numbers t and x G R n . 

When p = 1, oo it is easy to see directly from the definition that ||x|| p 
satisfies the triangle inequality and is therefore a norm on R™. This is also 
true when 1 < p < oo, as one can see by checking the convexity of the closed 
unit ball of ||x|| p . To be more precise, one can reduce this to the convexity 
of the function u p on the nonnegative real numbers when p > 1. When 
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< p < 1 and n > 2, \\x\\ p does not define a norm on R n . It does define a 
quasinorm, and indeed one has that 



(1-21) \\x + y\\ p <\\x\\l+\\y 



for all x,y G R n when < p < 1. 

If 7V(:r) is a norm on R n , then iV(a; — y) defines a metric on R n . If N{x) 
is a quasinorm on R n , then N(x — y) is a quasimetric on R n . For with 
< p < 1 we have that ||x — y\\ p defines a metric on R n . 

For any positive real number p and x G R n we have that 

(1.22) Halloo < M\p- 
Conversely we also have that 

(1.23) IMI P < IMU 

and in particular \\x\\ p tends to H^Hoo as p — > oo. More generally, if p, g are 
positive real numbers with p < q, then 

(1.24) NI,<N P 

for all x G R n , as one can show using the q = oo case. One also has that 

(1.25) ||a:|| p < n (1/p) " (1/9) \\x\\ q , 

which can be verified using the convexity of the function u q/p on the nonneg- 
ative real numbers. 



2 Lipschitz conditions 

Let M be a nonempty set, and let d(x,y) be a real-valued function defined 
for x, y G M. We say that d(x, y) is a semidistance function if y) > 
and d(y,x) = d(x,y) for all x,y G M. If also d(x,y) satisfies the triangle 
inequality, so that 

(2.1) d(x,z)<d(x,y)+d(y,z) 

for all x,y,z G M, then c?(a;, y) is said to be a semimetric. A real-valued 
function jV(x) on R n is called a seminorm if iV(x) > for all x G R n , 
N(tx) = \t\ N(x) for all real numbers t and all x G R n , and 

(2.2) N(x + y) < N(x) + N(y) 
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for all x, y G R n . If N(x) is a seminorm on R", then N(x — y) is a semimetric 
on R™. 

Suppose that M is a nonempty set and that d(x, y) is a semidistance 
function on M. Define p(x, y) for x, y G M to be the infimum of the sums 

i 

(2.3) XX^i'^i-i) 

3=1 

over all finite sequences Wi, . . . ,wi of points in M such that w = x and 
wi = y. Thus p(x, y) is a semidistance function on M such that 

(2.4) p(x,y)<d(x,y) 

for all x, y G M, and in fact p(x, y) is a semimetric because it satisfies the 
triangle inequality by construction. Conversely if T(x,y) is a semimetric on 
M such that r(x,y) < d(x,y) for all x, y G M, then r(x,y) < p(x,y) for all 
ij6 M. 

Let Ml, M 2 be nonempty sets with semidistance functions di, d 2 , respec- 
tively. If / is a mapping from Mi to M 2 and L is a nonnegative real number, 
then we say that / is L-Lipschitz with respect to these semidistance functions 
if 

(2.5) d 2 (f(x)J(y))<Ld 1 (x,y) 

for all x, y G Mi. 

Notice that for any mapping / : Mi — > M 2 we have that 

(2.6) d 2 (f(x),f(y)) 

defines a semidistance function on Mi, since d 2 is a semidistance function on 
M 2 . If d 2 is actually a semimetric on M 2 , then ()2.6|) is a semimetric on Mi. 
Of course the condition that / be L-Lipschitz is the same as saying that the 
semidistance function (|2.6|) is less than or equal to L times di(x, y). 

Suppose that M 3 is another nonempty set equipped with a semidistance 
function d$, and that f\, f 2 are mappings from Mi to M 2 and from M 2 to 
M3 which are L%, L 2 -Lipschitz for some Li,L 2 > 0, respectively. Then the 
composition f 2 o / 1; which is the mapping from Mi to M 3 defined by 

(2.7) (/ 2 o/ 1 )(x)=/ a (/ 1 (x)) ) 

is Li L 2 Lipschitz, i.e., one multiplies the Lipschitz constants. 
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As a special case, let M be a nonemtpy set with a semidistance function 
d(x, y) , and let / be a real- valued function on M. We say that / is L-Lipschitz 
for some L > if 

(2.8) \f(x)-f(y)\<Ld(x,y) 

for all x, y G M. This is the same as saying that / is L-Lipschitz as a mapping 
from M into the real line equipped with the standard metric \s — 1\, s,t G R. 

For a real-valued function / on M the property of being L-Lipschitz is 
equivalent to 

(2.9) f(x)<f(y) + Ld(x,y) 

for all i,|/6 M. In particular, if d is a semimetric and p is a point in M, 
then the function / p (x) = d(x,p) is automatically 1-Lipschitz. 

By contrast, suppose that we take M to be the real line equipped with 
the quasimetric \x — y\ q where q > 1, and that / is a real- valued Lipschitz 
function with respect to this quasimetric on the domain, which means that 

(2.10) \f(x)-f(y)\<L\x-y\* 

for some L > and all x, y G R. In this event one can check that / must be 
a constant function. 

Let M be a nonempty set with semidistance function d, and suppose that 
R ra is equipped with a seminorm N. Let / be an L-Lipschitz mapping from 
M to R n with respect to the semimetric associated to this seminorm, which 
is to say that 

(2.11) * N(f(x)-f(y))<Ld(x,y) 

for all x, y G M. If t is a real number, then tf(x) is \t\ L-Lipschitz as a 
mapping from M to R n . If j\, f 2 are L 1; L 2 -Lipschitz mappings from M to 
R n , then the sum f\ + / 2 is (L x + L 2 )-Lipschitz. 

Now let us specialize to the case of real- valued functions, and the usual ab- 
solute values on the real line. Specifically, suppose that f±, / 2 are real- valued 
functions on M which are L±, L 2 -Lipschitz, respectively. One can check that 
the maximum and minimum of f\, / 2 are max(L 1; L 2 )-Lipschitz functions 
on M. Under the assumption that fx, / 2 are also bounded, the product of 
/i and f 2 is Lipschitz too. More precisely, if A±, A 2 are nonnegative real 
numbers such that 

(2.12) \fi(x)\<A 1 , \h{x)\<A 2 

for all x G M, then the product of fx and f 2 is (Ai L 2 + A 2 L^-Lipschitz. 



9 



3 Connectedness 



Let M be a nonempty set equipped with a distance function d(x,y), and let 
e be a positive real number. By an e-chain in M we mean a finite sequence 
Wq, . . . , Wi of points in M such that 



for i = 1, . . . , I. A subset E of M is said to be e- connected if for every pair of 
points x,y G E there is an e-chain w , . . . , wi of points in E such that w = x 
and if; = y. 

Thus a subset E of M is not e-connected if there is a pair of points in E 
which cannot be connected by an e-chain. This is equivalent to saying that 
E can be expressed as the union of two nonempty sets A, B such that 



for all u G A and v G B. For in this case there is no e-chain in E connecting 
points in A to points in B. Conversely, if E is not chain connected, so that 
there exist x,y G E which cannot be connected by an e-chain of points in E, 
then one can define A to be the set of points in E which can be connected 
by an e-chain of points in E to x, and B to be the set of remaining points in 
E, and A, B have the properties described earlier. 

Suppose that Mi, M 2 are nonempty sets and that d%, d 2 are distance 
functions on them. Let e > be given, and assume that E is an e-connected 
subset of Mi. Also let / be an L-Lipschitz mapping from Mi to M%. It is easy 
to see that f(E) is then an L e-connected subset of M 2 . There are analogous 
statements for more general "uniformly continuous" mappings, with more 
complicated adjustments to e. 

Let M be a nonempty set with distance function d and let E be a subset 
of M. We say that E is chain- connected if E is e-connected for all e > 0. 
As in the preceding paragraph, the image of a chain-connected set under a 
uniformly continuous mapping is also chain connected. 

A subset E of M is not chain connected if it is not e-connected for some 
e > 0, which is equivalent to saying that there is an e > such that E can 
be expressed as the union of two nonempty sets A, B, with d(u, v ) > e for 
all u G A and v G B. In particular, a subset of a metric space which is not 
chain connected is not connected in the sense discussed in |15j . 

This is equivalent to saying that a subset of a metric space which is 
connected in the sense of [T5] is also chain connected. The converse is not 



(3.1) 



d{wi,Wi„i) < e 



(3.2) 



d(u, v ) > e 
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true in general, however. For instance, the set of rational numbers, as a 
subset of the real line with the standard metric \x — y\, is chain connected 
but not connected. 

Note that a basic result about connected subsets of metric spaces is that 
the image of a connected set under a continuous mapping is connected too. 
For a compact subset of a metric space, one can show that connectedness 
and chain connectedness are equivalent. 

4 Hausdorff content 

Let M be a nonempty set and d(x, y) a metric on M, so that M becomes 
a metric space. A subset E of M is said to be bounded if the set of real 
numbers of the form d(x, y) for x,y G E has an upper bound. The least 
upper bound or supremum of this set of numbers is called the diameter of E 
and is denoted diamE, at least if E is not the empty set. We can also define 
the diameter of the empty set to be equal to 0. If E is a bounded subset of 
M, then the closure E of E, as discussed in [T5] . is also a bounded subset of 
M, and with the same diameter as E. 

Let a be a positive real number. If Ai,...,Ak is a finite collection of 
bounded subsets of M, then consider the quantity 

(4.1) ^(diamAr, 

i=i 

which is a kind of a-dimensional measurement of size of the A^'s. If E is a 
bounded subset of M, then we define 7i a (E), a version of the a-dimensional 
Hausdorff content of E, to be the infimum of the sums (|4.1j) over all finite 
collections A%, . . . , Aj~ of bounded subsets of M such that 

(4.2) EC\jA t . 

i=i 

For instance, we could take k = 1 and A% = E, in which case (14. lj) reduces 
to (dia.mE) a . In particular we automatically have that 

(4.3) H a {E) < (diamEf. 

As a special case we have that T~C a of the empty set is equal to 0. Also, if Ei, 
Ei are two bounded subsets of M such that E\ C £J 2 , then 

(4.4) H a (E 1 ) < H a (E 2 ), 
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because any covering of E 2 by finitely many bounded subsets A\, . . . , A/~ of 
M is also a covering of E\. 

For any bounded subset E of M, 

(4.5) H a (E) = H a (E). 

Indeed, Tt a (E) < TL a (E) simply because E C E. Conversely suppose that 
Ai,...,Ak are finitely many bounded subsets of M such that E is contained 
in the union of the A^s. Then the closure of E is contained in the union of 
the closure of the A^s, and since the diameter of the closure of a set is equal 
to the diameter of a set, it follows that TC a (E) < Tt a (E). 
If E, F are bounded subsets of M, then 

(4.6) n a (EuF) <n a (E) + n a (F). 

To see this, let A\, . . . , A^ and B\, . . . ,Bi be arbitrary finite collections of 
bounded subsets of M such that 

k I 

(4.7) EQ\jAi, FC{J B y 

i=l j=l 

Then the combined collection Ai, . . . , A^, Bi, . . . , B\ is a finite collection of 
bounded subsets of M such that the union of E and F is contained in the 
union of the A^s and Bj's together, and it follows that 

k I 

(4.8) H a (E UF)< ^(diam Ai) a + ^(diam^) Q . 

i=i j=i 

Because A±, . . . , Ak and Bi,...,Bi are arbitrary finite collections of bounded 
subsets of M which cover E and F, respectively, we get (|4.6|) . as desired. 

Now suppose that Mi, M 2 are metric spaces with metrics di, d 2 , respec- 
tively, and that / is an L-Lipschitz mapping from Mi to M 2 . If E is a 
bounded subset of Mi, then 

(4-9) H a M2 (f(E))<L a n a Ml (E), 

where the subscripts Mi, M 2 for 7i a are used to indicate explicitly in which 
metric space one is working. Indeed, if A is any bounded subset of Mi, then 
f(A) is a bounded subset of M 2 , and the diameter of f(A) in M 2 is less 
than or equal to L times the diameter of A in M x . If Ai, . . . , A^ is a finite 
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collection of bounded subsets of Mi such that E is contained in the union 
of the Aj's, then f(A\), . . . , f(A k ) is a finite collection of bounded subsets 
in M2 and f[E) is contained in the union of the /(A^'s. The sum of the a 
powers of the diameters of the /(A^'s in M2 is less than or equal to L a times 
the sum of the a powers of the diameters of the Aj's in Mi, which leads to 

(USD. 
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